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ABSTRACT 

It  is  often  convenient  to  apply  a  nonlinear  model  developed  for  a  single 
unit  to  data  representing  the  average  response  for  several  units.  The 
relationship  between  the  parameters  of  the  aggregate  model  and  those  of  the 
individual  units  is  investigated  in  a  geometrical  framework,  for  a  general 

nonlinear  model.  The  aggregation  effect  derived  is  closely  related  to  the 

\ 


bias  in  nonlinear  estimation,  as  given  by  Box  (1971).  Unlike  the  bias, 
however,  the  aggregation  effect  may  be  of  comparable  magnitude  to  the  standard 
error  of  the  estimates.  The  theoretical  results  derived  are  verified 
empirically  in  an  application  to  a  model  of  residential  energy  consumption. 
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SIGNIFICANCE  AND  EXPLANATION 


Analysis  of  complex  physical  or  economic  systems  often  involves  the  use 
of  a  model  which  is  a  nonlinear  function  of  certain  coefficients  to  be 
estimated  from  data.  The  same  basic  model  may  describe  the  behavior  of 
several  different  units,  such  as  households  or  individuals}  however,  the 
values  of  the  coefficients  vary  from  unit  to  unit.  To  determine  the  model 
coefficients  it  may  be  impractical  to  obtain  data  from  each  of  the 
individuals.  Instead,  the  available  data  represent  averages  over  ail  the 
individuals.  When  the  basic  model  used  is  nonlinear,  estimates  based  on  these 
aggregate  data  will  be  systematically  different  from  those  which  would  be 
obtained  by  averaging  estimates  based  on  the  individual-unit  data. 
Understanding  this  difference  is  important  if  the  aggregate  results  are  to  be 
interpreted  as  representative  of  the  group  of  individuals.  This  paper  derives 
a  formula  for  the  systematic  difference,  and  gives  guidelines  for  determining 
if  this  effect  is  important  in  a  given  application. 


The  responsibility  for  the  wording  and  views  expressed  in  this  descriptive 
summary  lies  with  MRC,  and  not  with  the  author  of  this  report. 
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1.  INTRPPVCTIPN 

Aggregate  data  are  collected  and  analyzed  in  a  wide 
variety  of  contexts.  Macroeconomic  and  sociological  studies 
provide  obvious  examples,  but  in  the  physical  sciences  as 
well  aggregate  data  may  be  relied  on  as  a  matter  of  neces¬ 
sity  or  convenience.  A  reasonable  approach  to  analyzing  data 
representing  the  average  response  of  several  units  is  to  fit 
the  data  to  the  same  model  which  describes  a  single  unit's 
response.  However,  if  this  model  is  nonlinear,  the  resulting 
parameter  values  are  different  from  those  which  would  be 
obtained  by  analyzing  the  units  separately  and  then  averag¬ 
ing.  Understanding  this  difference  Is  important  if  an  indi¬ 
vidual  unit  Is  to  be  compared  to  the  "typical M  embodied  in 
the  aggregate. 

In  this  paper,  a  simple  physical  model  of  household 
energy  demand  provides  the  reference  point  for  an  explora¬ 
tion  of  the  effect  of  aggregation  in  a  nonlinear  least 
squares  regression.  The  model  and  aggregation  are  described 
in  the  remainder  of  this  Introductory  section.  Section  2 
then  lays  out  the  geometry  of  the  general  nonlinear  least 
squares  regression,  following  Bates  and  Watts  (1980).  Dis¬ 
cussion  of  biases  begins  in  Section  3  with  a  review  of  M.  J. 
Box's  (1971)  calculation  of  the  bias  for  the  single-unit 
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case  (i.e. ,  without  aggregation).  In  Section  3,  Box's  work 
Is  Interpreted  In  the  geometrical  context  established  in 
Section  2.  Finally/  in  Section  4  the  aggregation  effect  is 
derived  and  related  to  Box's  results  for  the  single  unit, 
again  in  terms  of  geometry.  The  application  of  the  main 
result  in  Section  4  to  the  model  which  motivated  this  study 
is  presented  in  Section  5. 


1.1.  The  Residential  Energy  Consumption  Model 


This  study  is  the  outgrowth  of  a  broad-range  investiga¬ 
tion  of  energy  consumption  in  houses  which  has  taken  place 
over  the  past  ten  years.  (See,  eg.  Socolow  (1978)  and  Dutt 
et  al  (1982).)  A  basic  model  employed  in  the  household 
energy  studies  describes  a  house's  daily  fuel  consumption 

7  during  period  (or  month)  m  as  a  simple  function  of  the 

m 

N  daily  outdoor  temperature  readings  T  . : 

m  iu  j 


with 


Y  -  a  +  BX  (t)  +  e 

Q  Bl  Ol 


(l.i) 


N. 


v«  -  si  jV'V*  +  v 


(1.2) 


The  subscript  '+'  in  (1.2)  indicates  that  the  term  in 
parentheses  is  set  to  zero  if  negative.  The  truncated  sum 
X  (t)  is  the  average  degree-days  per  day,  base  t.  This 

B 

truncation  is  the  source  of  the  nonlinearity  in  the  model 

(1.1). 

Physically/  the  intercept  a  represents  baseload  con¬ 
sumption,  such  as  for  water  heating  and  cooking,  the  slope 
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0  is  a  heating  rate  (or  effective  heat  loss  rate)  and  the 
reference  temperature  x  is  the  outdoor  temperature  at 
which  the  furnace  first  comes  on.  The  parameters  ot,  3, 
and  x  are  estimated  by  the  method  of  least  squares,  using 
data  Y_  from  utility  billing  records  for  the  house. 

Changes  in  these  parameters,  when  correctly  interpreted, 
reflect  the  impact  of  conservation  measures  undertaken  in 
the  house. 

An  overall  conservation  index  T  is  obtained  from  the 
parameters  a,  3,  and  x  as 

T  -  365 (a  +  X  ( x ) )  (1.3) 

o 

where  XQ(x)  is  the  average  base-x  degree-days  per  day, 
over  a  base  period  of  several  years.  He  call  the  index  T 
the  normalized  annual  consumption. 

Aggregate  versions  of  (1.1)  and  (1.3)  have  been  used 
to  analyze  state-  and  utility-  wide  natural  gas  consumption 
in  the  residential  heating  sector  (Fels  and  Goldberg,  1982 
and  Goldberg  and  Fels,  1982).  These  analyses  represent  an 
aggregation  over  many  thousands  of  houses,  for  which  indivi¬ 
dual  data  are  not  available.  Another  potential  application 
of  the  aggregate  model  is  for  the  "scorekeeping"  of  large- 
scale  retrofit  programs.  If  the  model  (1.1)  is  to  be 
employed  in  assessing  the  total  savings  achieved  in  a  large 
number  of  houses,  then  modeling  the  aggregate  can  simplify 
the  scorekeeping  task.  One  purpose  of  this  study  is  to 
determine  the  adequacy  of  such  a  shortcut. 


Assume/  then,  that  we  have  U  different  housing  units, 
each  of  whose  consumption  is  described  by  (1.1)  with  param¬ 
eters  au,  0U,  TU,  and  errors  e^,  u*l,2,...,U.  Defining 

the  aggregate  consumption  Y  as  the  simple  average  of  con¬ 
sumption  in  the  U  houses,  we  have 

1  u  u  u  u 
Ym  “  U  J(ot  +  P  VT  >  +  em>-  (1-4> 

tie  will  fit  the  aggregate  Y  to  (1.1).  The  question  to  be 

m 

answered  in  this  paper  is,  how  are  the  resulting  parameter 
estimates  related  to  the  individual  parameters  au,  0U, 
and  tu?  The  answer  will  be  derived  for  a  general  nonlinear 
model,  then  applied  to  the  energy  example  of  our  illustra¬ 
tion. 


2.  EE  GEOMETRY  0£  NONLINEAR  LEAST  SQUARES 

The  general  nonlinear  model  with  unknown  p-dimenslonal 
parameter  8  can  be  written  in  matrix  form  as 

Y  ■  T}(0)  e  (2.1) 

E(e)  *  0 

E(e'e)  *  a2I. 

In  (2.1),  Y,  T|,  end  e  are  M-dimensional  vectors,  such 
that  r^,  the  mth  component  of  r|,  depends  on  observa¬ 
tions  Xjg  as  well  as  on  8.  For  the  heating  model  (1.1), 

6  ■  Ca  0  x3' ,  and  r)(8)  ■  a+0X<T),  where  X  is  the  M- 
dlmenslonal  vector  with  components  given  by  (1.2). 
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For  the  general  model,  aa  8  ranges  over  the  parameter 
space  8,  the  function  t)  sweeps  out  a  p-dlmensional 
expectation  surface  S  in  the  M-dimensional  sample  space: 

3  -  £rj(8) :  6e©l.  (2.2) 

He  assume  the  function  t)  to  be  twice  differentiable, 
and  define 


tj(0>  -  -|§t  *  \ 

L  jJMxp 


T]  (8) 


2  r  a2n  1 
an  =  |  8T1m  I 


(2.3) 


(2.4) 


aeae*  i  ae.ae.i 

L  x  ^  J  Mxpxp 

(Throughout  this  paper,  an  expression  in  square  brackets 
indicates  a  array  with  components  given  by  the  subscripted 
expression,  such  that  m*l,2,...,M;  i, j»l,2,. . . ,p. ) 


Me  then  write  the  tangent  vector  at  t)Oq)  in  the 
direction  v  as 


fcv  *  T)<e0,v- 

The  acceleration  vector  ay  is  given  by 

ay  ■  v* tj*  ( 8 ) v,  (2.5) 

where  the  mth  component  of  the  M-vector  av  is  the  qua- 

•  • 

dratlc  form  in  v  given  by  the  mth  face  of  r\  .  A  meas¬ 
ure  of  the  nonlinearity  of  the  model  is  provided  by  Bates' 
and  Matts'  (1980)  relative  curvature  y^,  defined  as 
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(2.6) 


iV 


It  I 
v 


2' 


where  the  scale  factor 


Below,  we  refer  to  the  relative  curvature  y  simply  as  the 
curvature. 

The  acceleration  can  he  decomposed  into  two  components 

a^  and  a£,  which  are  respectively  parallel  and  normal  to 

the  tangent  plane  at  t\(Qq).  Substituting  the  tangential 

a^  ^  and  normal  a-^-  for  a  in  (2.6)  gives  analogous  defin- 
v  v  v 

Itlons  of  the  tangential  and  normal  curvatures  y* ;  and 

y^-,  respectively.  Noting  that  the  tangential  acceleration 

component  is  caused  by  the  parameterization  chosen,  while 

the  normal  component  is  Independent  of  this  choice.  Bates 

and  Hatts  refer  to  the  tangential  y^*  as  the  "parameter- 

effects"  curvature,  and  to  the  normal  y^  as  the  "intrinsic 

v 

curvature."  As  will  be  seen  in  the  next  section,  biases  in 
parameter  estimates  relate  directly  to  the  parameter-effects 
curvature  y^*  while  biases  in  estimating  t)(6),  the  con¬ 
ditional  expectation  of  7  given  the  data  X,  relate  to  the 
intrinsic  curvature  y^-. 


2.  ESTIMATION  ERRORS  FOR  THE  SINGLE  UNIT  NONLINEAR  MODEL 

For  the  general  nonlinear  model  (2.1)  with  continuously 
differentiable  n*  the  least  squares  estimate  §  is  the 
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solution  to  the  normal  equation 


n'<e>(Y  -  n<e>>  *  o.  c 3. i > 

From  this  formula.  Box  (1970)  shows  that  the  error  in  the 
least  squares  estimate  0  Is  given  to  second  order  in  e 
by 

©  -  0  =  +  q  (3.2) 

where  q  Is  quadratic  In  e.  The  derivative  n  is 
evaluated  at  the  true  6.  To  the  order  of  this  approxima¬ 
tion,  the  bias  in  &  Is  given  by  the  expectation  of  q, 
which  Box  derives  as 

b  *  E(q)  (3.3) 

1  *  *  -1*  **  ‘*-12 
«  -  2  i t>>  T)'Ctr(T)  m(r>'T))  o  )3. 

•  •  •  •  • 

Here,  n  _  denotes  the  mth  face  of  n  •  Since  n  is 

evaluated  at  the  true  6,  the  first-order  term  has  zero 

expectation.  Of  course,  for  tj  linear  in  0,  the  first- 

order  term  in  (3.2)  is  the  usual  linear  regression  error 

term,  and  there  are  no  higher  order  terms. 

To  see  the  implications  of  Box's  formula,  we  expand 

t)(6)  and  ti($)  in  a  Taylor  series  about  0.  With  the 

results  given  by  (3.2)  and  (3.3),  this  expansion  yields, 

2 

again  to  0(a), 

E(r|($)  -  ri<©>  > 

«  >  •  • 

Z  ti(0)E(6-0)  +  |E<C(©-0)'T)  b(&-0)3) 


(3.4) 


■  rj(9)E(q)  +  |  E(Ce,n<n'T|)“1»»  a(n'T)>“V cl)  0(o3) 

-  ritTi'^'^'Ctrtn’aitTi^)"1®2)^  +  \  Ctr^^n'n)'1 
#  1  * 

In  (3.4),  P=t)(t}'tj)  tj'  is  the  projection  matrix  onto  the 

tangent  plane  at  6.  Noting  from  (3.2)  that 


E(  (6-0) (0-0)')  =  fn'ri) -1ar2  0(a3), 


(3.5) 


we  find 


E(Ti(§)-ri(e))  (3.6) 

(I-P)Ctr(ti*n  E(  ($-0)  (0-0) ' )  K  n(or3). 

Thus,  the  expected  error  in  the  fit  r)(0>  is  orthogonal  to 
the  tangent  plane  at  T)(9). 

Equations  (3.3)  and  (3.6)  can  be  expressed  in  simpler 
form  by  writing  Box's  bias  approximation  in  terms  of  the 
accelerations  def ined  by  (2.5).  Equation  (3.3)  then 
becomes 


E(0  -  0)  ~  -|(n'T)>“1T)'E(4)  (3.7) 

•  -^(ri'rir^'Etd1  1 ), 

where 


A  ■  (6  -  0) 'h* (0  -  ®> 

is  the  acceleration  in  the  direction  0-9.  Similarly,  the 


orthogonality  condition  (3.6)  can  be  written  as 


Over  the  range  of  values  displayed,  the  expectation 
surface  Is  Indeed  quite  symmetrical,  and  nearly  parabolic. 
Thus,  the  quadratic  approximation  appears  to  be  quite  good. 
The  expected  point  of  the  fit  t)(£)  as  given  by  (3.6)  Is 
Indicated  in  the  figure  by  the  sign  slightly  offset 

from  the  curve,  on  a  line  perpendicular  to  the  tangent  line. 

Above  the  expectation  surface,  against  the  same  tangent 
axis,  the  probability  density  function  is  shown  for  the 
tangential  component  of  the  random  error  e.  The  symmetry 
of  this  density  function,  which  Implies  the  orthogonality 
condition  (3.6),  also  means  that  in  terms  of  a  uniform  coor¬ 
dinate  system  on  the  tangent  plane,  the  parameter  estimate 
Is  unbiased.  That  is,  if  equally- spaced  points  in  ©  were 
mapped  under  Pt)  to  equally- spaced  points  in  the  tangent 

A 

plane,  the  least-squares  estimate  0  would  be  unbiased. 

The  nonuniform  coordinate  system  imposed  on  the  tangent 
line  Is  Indicated  by  the  tick  marks,  representing  evenly- 
spaced  values  of  6.  The  compression  of  these  marks  for 
large  values  of  6  reflects  the  parameter-effects  non¬ 
linearity.  As  a  result  of  this  compression,  an  error  e  In 
the  positive  direction  along  the  tangent  line  will  cause  an 
error  of  much  larger  magnitude  in  0  than  an  equal  error  c 
in  the  negative  direction.  Hence,  despite  the  intrinsic 
symmetry  of  the  surface  r\lQ)  and  the  errors  e,  the 
parameter  estimate  6  is  biased. 


The  bias  induced  by  the  parameterization  can  he  seen  by 
mapping  the  tangent  line  and  its  overlying  density  back  to 
the  parameter  space  ©,  as  shown  in  Fig.  lb.  This  mapping 
corresponds  to  stretching  and  squeezing  the  horizontal  axis 
so  that  the  tick  marks  are  equally  spaced,  with  the  result 

A 

that  the  density  function  for  0  Is  skewed.  Thus,  the 

A 

expected  value  of  0,  as  determined  from  Box's  formula 
(3.3),  is  seen  in  the  figure  to  be  offset  from  the  true 
value  0,  which  Is  the  mode  of  the  distribution. 

The  same  framework  used  to  illustrate  Box's  bias  in 
Flg.l  will  help  to  describe  the  aggregation  effect  developed 
below.  We  first  explore  the  errors  introduced  by  aggrega¬ 
tion,  then  consider  the  more  general  model  misspecif icatlon 
problem  of  which  aggregation  Is  a  special  case. 

£.  T£E  AGGREGATE  NONLINEAR  MODEL 

£•!•  General  Framework 

Section  1  describes  the  aggregation  problem  for  the 
residential  energy  consumption  model  (1.1).  For  the  general 
case,  we  assume  a  population  of  U  different  units,  each  of 
whose  responses  YU  can  be  described  by  (2.1),  but  with 
different  values  0U  of  the  unknown  parameter.  We  cannot 
observe  any  individual  unit's  response,  but  at  each  time  m 
we  observe  the  average  of  the  U  responses.  Thus, 


density  function  mapped  back  to  the  parameter  space  is  skewed,  resulting  in  the 


(4.1) 


with 


and 


y  -  i  z<n<eu>  +  eu> 

u 


■  n  +  e, 


e  -  J  Zeu 
u 


(4.2) 


n  «  i  zn(eu) . 
u  u 

We  define  0  and  D  as  the  population  mean  and  dispersion 
of  parameter  values  6U: 


0  »  §  Z0U 
u 


d  -  h  E(eu  -  6) (6U  -  0)'. 


(4.3) 


(4.4) 


We  could  equally  well  assume  the  0U  to  he  a  random 
sample  from  a  population  with  mean  0  and  variance- 
covariance  matrix  D.  Since  it  is  possible  to  obtain  the 
results  we  are  Interested  in  with  reference  to  the  particu¬ 
lar  Individuals  observed,  l.e.  without  appealing  to  expecta¬ 
tions,  we  do  so.  This  approach  avoids  any  confusion  which 
might  be  caused  by  having  random  units  as  well  as  random 
disturbances  e.  The  expectation  operator  E  will  refer  in 
all  cases  to  expectations  over  the  set  of  possible  distur¬ 
bances  e . 

Since  rj  is  a  function  of  up  to  pU  different  parame¬ 
ters,  and  the  individual  responses  are  unobservable,  we  take 


-13- 


*  simplified  model,  fitting 


Y  -  t)( 0 )  +  e.  (4.5) 
The  least  squares  estimate  0  is  found  by  solving  the  nor¬ 
mal  equation  (3.1) ,  which  can  now  be  written  as 


•n<S) '  <T1  +  e  -  h($))  »  0. 


(4.6) 


i-2*  Asymptotic  &ggESg&£ign  Effect 


By  setting  e  *  0  and  solving  (4.6)  for  0,  we  find 
the  aggregate  value  0A  for  which  the  least  squares  esti¬ 
mate  8  Is  a  consistent  estimator: 


^(^>'(1)  -  h(0A))  *  0. 

A 


That  Is,  the  "asymptotic  estimate”  0' 
point  where  the  discrepancy 


(4.7) 

corresponds  to  the 


Z  ■  T)(  0^)  -T) 

Is  normal  to  the  expectation  surface.  Of  course,  1)0*)  is 
the  point  on  the  expectation  surface  closest  to  n» 

Following  Box  (1971)  we  find  the  difference  6 

between  the  average  parameter  value  0  and  the  asymptotic 
A  — 

estimate  0  by  expanding  t)  In  the  normal  equation  (4.7) 

i 

about  the  asymptotic  6  : 

0  Z  r)'Cn(©A>  +  T| ( y E ( 0U  -  0A)}  (4.8) 

u 
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+  \  J  z<eu  -  eA>'n  <ou  -  eA>  -  ri( eA> 
u 

«  n'CriS  +  H  Z(0U  -  0)'T|*(0U  -  ©> 

L  u  u 

+  \  S'T)*S} 

«  Ti'CriS  +  \  ctrtri*  D)3  +  J  6'ti*S}.. 
z  zn  z 

where 

S  •  0  -  0A. 

Assuming  86'  <<  D,  (4.B)  gives,  to  first  order  in  D, 

o  -  <n'n>«  +  2  n'ctr<n  mD)j.  <4. 9) 

Hence , 

8  Z  -  \  (^'nj'^'Ctrtn  nD>3.  (4.10) 

Note  that  the  approximation  (4.10)  is  valid  provided  the 
value  6  it  yields  satisfies  the  above  assumption  that 
66'  <<  D. 

Recalling  (3.5),  we  see  that  the  aggregation  effect  6 
given  by  (4.10)  is  the  same  a3  the  bias  b  given  by  (3.3), 
with  the  population  variance  D  replacing  the  estimation 

7  •  *  _i 

variance  o  (t)'tj)  .  If  we  adopt  the  shorthand  notation 


a  *  a 

u 


(0u-0) 


»  <  ©u-© ) '  r>*  <  ©U-0 )  1 0 


and  define 


(4.11) 
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»  -  o  *V 
u 


(4.12) 


then  by  arguments  similar  to  those  used  In  the  previous  sec¬ 
tion  we  have 


S ;  -§  (VnrlW 


1  *  *  -1*  -I  | 

-  "2  n'a"; 

t)(6)  -  t)  z  2  a; 

T)(0A)  -  t)  Z  i  a 


(4.13) 


(4.14) 


(4.15) 


Thus,  the  difference  between  t)(0>,  the  model  at  the 
average  parameter  6,  and  the  model  average  n  is  approxi¬ 
mated  by  the  second-order  term  (1/2)  a  (In  (4.14)).  The 
difference  between  the  asymptotic  estimate  ^(0*)  and  ri 
Is  approximated  by  the  normal  component  of  the  acceleration 

term,  (1/2)  a^  (In  (4.15)).  The  aggregation  effect  S  is 

approximated  by  the  coefficients,  with  respect  to  the 

*  1 1 
columns  of  n»  of  the  tangential  component  (1/2)  a’ 1  (In 

(4.13) ).  Note  that  even  if  t|(0)  is  an  exact  quadratic, 

(4.13)  and  (4.15)  are  still  only  approximate,  because  we 

have  Ignored  products  of  first  and  second  derivatives  with 
2 

terms  of  order  S  . 

The  effect  of  aggregation  is  illustrated  in  Fig.  2,  for 
the  same  data  set  displayed  in  Fig.  1.  For  this  illustra¬ 
tion,  we  have  assumed  that  the  random  error  e  ■  0,  and 


taken  the  dispersion  D  of  actual  parameter  values  0U  to 
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be  identical  to  the  dispersion  matrix  of  estimates 

2  *  *  -1  - 
d  from  the  original  example.  The  point  t)  shown 

In  Fig.  2a  Is  that  given  by  the  approximation  (4.15). 

He  have  further  assumed  that  the  values  of  6U  are 
symmetrically  distributed  about  their  mean  value  6.  This 
assumption  is  useful  for  illustration,  and  will  be  natural 
in  many  situations.  Figure  2a  shows  how  the  points  n<0u) 
on  the  expectation  surface  are  asymmetrically  distributed, 
as  a  result  of  the  nonuniform  mapping  from  the  parameter 
space  0.  Thus,  the  average  sample-space  point  n  is  on  a 

A 

line  orthogonal  to  the  surface  at  ti(6  ),  not  at  r)(6). 

The  difference  between  the  projection  and  the  point 
corresponding  to  the  average  ©  is  translated  back  to  the 
parameter  space  in  Fig.  2b. 

Figures  1  and  2  illustrate  the  similarity  between  the 
bias  and  the  aggregation  effect.  Comparing  the  correspond¬ 
ing  formulae  (3.3)  and  (4.10)  shows  that  the  relative  impor¬ 
tance  of  bias  and  aggregation  depends  on  the  relative  magni- 

2  *  *  -i 

tude  of  the  estimation  variance  a  (n'n)  and  the  popula¬ 
tion  variance  D. 

The  estimation  variance  is  conveniently  represented  by 
the  standard  radius  p  defined  by  (2.7).  This  radius  meas¬ 
ures  the  approximately  spherical  spread  of  estimates  rt<®> 
about  their  expected  value:  according  to  the  linear  approxi¬ 
mation 
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(4.16) 


E(  |  T)($)  -  T) ( 0 )  |  2 )  £  pa2. 

nxe  relative  curvature  y  which  incorporates  the 
radius  p  as  a  normalizing  constant  indicates  the  potential 
magnitude  of  the  bias,  but  not  of  the  aggregation  effect. 

The  latter  depends  upon  the  dispersion  of  expectation  points 
T|( 0U) ,  which  is  not  in  general  spherically  symmetric,  even 
to  a  first  approximation.  Hence,  no  single  radius  which 
might  be  defined  by  analogy  with  (4.16)  adequately  describes 
this  spread.  In  comparing  the  bias  with  the  aggregation 
effect,  the  Important  consideration  is  the  spread  of  the 
estimates  compared  to  the  spread  of  population  values,  not 
overall,  but  in  the  directions  of  large  parameter-effects 
curvature . 

When  the  bias  and  aggregation  effects  are  of  comparable 
magnitude,  the  combined  effects  of  dispersion  of  values  0U 
and  nonzero  random  disturbances  e  must  be  considered. 
Fortunately,  as  will  be  seen,  the  two  effects  can  be  treated 
as  additive. 


Finite  Sample  Bias  for  the  Aggregate  Nonlinear  Model 

Equation  (3.3)  gives  Box's  expression  for  the  bias  due 
to  nonlinearity  alone  for  a  nonlinear  least  squares  estima¬ 
tor.  We  will  derive  a  similar  expression  for  the  nonlinear¬ 
ity  bias  in  the  presence  of  model  misspecif ication.  The 
particular  misspecification  we  are  concerned  with  is  the 
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averaging  described  In  Section  4.1.  However,  our  develop 
aent  will  be  somewhat  more  general. 

Assume  as  above 


Y  ■  n  +  e. 

E(e)  =  0. 

Var(e)  »  o2I. 

2  *  n  -  ti(0a>  . 

*  0.  (4.17) 

This  is  the  same  formulation  as  used  above,  but  now  n  may 
be  regarded  as  any  function  which  does  not  lie  on  the  expec¬ 
tation  surface  S  defined  by  r\(.Q) ,  the  function  fit  to 
Y.  He  consider  the  effect  of  e  on  the  least  squares  esti¬ 
mate  of  8*. 

The  normal  equation  (3.1)  can  now  be  written  as 

Ti'<e)(<n  +  e)  -  n<$>>  *  o.  (4.18) 

A 

Expanding  (4.16)  about  6  yields 

0  ■  <T)  +  ($  -  0A)'q  )'  (4.19) 

x(n  +  e  -  (t>  +  n($  -  eA)  +  i(0  -  eA)'n  <o  -  eA)))i  . 

*  0A 

Since 

Y  ■  n(0A)  +  z  +  e. 

It  Is  appropriate  to  treat  the  total  discrepancy  z  +  e  as 
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a  single  quantity  producing  errors  In  the  estimate  of  6  . 


Thus,  following  Box,  we  assume 


8  -  eA  :  ciw)  +  q. 


(4.20) 


t&ere  C  is  a  pxM  matrix  and  the  p-dlmenslonal  q  Is 
quadratic  In  z  +  e.  Equation  (4.17)  then  becomes  (evaluat¬ 


ing  all  derivatives  at  0  ) : 


0  *  (T)  +  (z+e)'C'ri  +  q'Tj  )' 


(4.21) 


x(z+e  -  TjC(z+e)  -  nq  -  ,(z+e) 'C'tj  C(z+e) 


+  0( (z+e)  ) . 

Equating  to  zero  successive  powers  of  z+e ,  we  have 


(tj'  -  T)'T)C)  ( z+e)  =  0 


so  that 


(4.22) 


c  -  (n'n)  V. 


To  second  order. 


0  »  (z+e)'C'Ti  '(I-P)(z+e)  -  Ti'rjq 


-S. 


jT)'  ( z+e)  'C'ti  C( z+e) 


(4.23) 


(4.24) 


■  e'C'Tj  'z  +  e'C'T)  *  (I-P)z  -  Ti'rjq 
-  Tt|' e'C'rj  Ce. 


Hence , 


q  ■  (n'n)"1(e'C'rj  'z  +  e'C'rj* '  (I-P)e 


(4.25) 


-  ^rj'e'C' T}  Ce)  . 

Box  has  shown  that  the  second  term  in  parentheses  has  zero 
expectation.  Thus,  to  the  order  of  our  approximations,  the 
bias  is  given  by 

E(fi  -  0A>  ~  E(C( z+e)  +  q)  (4.26) 

*  E(q) 

1  *  *  -1*  **  **-12 
-  -  2  (T''T^)  h'Ctrd)  m(n'n)  a  >3 

which  is  Identical  to  (3.3).  That  is,  the  first  approxima- 

A 

tlon  to  the  random-error  induced  bias  of  0  is  the  same  as 
for  the  correctly  specified  model.  The  result  is  reasonable: 
(3.3)  shows  the  bias  has  no  dependence  on  the  error  com¬ 
ponent  normal  to  the  tangent  plane  at  n(®>r  and  the  effect 
of  mlsspecifl cation  is  to  add  a  normal  error  component  z 
to  r)(0A). 

Thus,  to  first  order,  the  effects  of  aggregation  and 
random  errors  are  additive.  Combining  (3.3)  and  (4.10)  we 

A 

have  the  expected  error  in  the  aggregate  estimate  0-0  as 

E($  -  6)  Z  h  Ctr(r»’  <D  -  a2(n,r)>"1  >  >3.  (4.27) 

4  m 

A 

The  asymptotic  estimate  9  corresponds  to  the  point 
t)(6A)  on  the  expectation  surface  S  which  comes  closest 
to  hr  the  expected  value  of  the  aggregate  T.  The  point 
t)(8A)  is  Invariant  under  reparameter i rations,  which  is  not 
true  of  n<0>*  For  many  purposes,  then,  the  consistently 
estimated  8A  and  not  the  population  average  6  better 
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characterizes  the  aggregate.  On  the  other  hand,  the  popula¬ 
tion  mean  9  may  have  a  more  natural  Interpretation  than 

A 

the  asymptotic  estimate  6  ,  which  is  defined  only  in  terms 
of  the  regression  model.  In  such  a  case,  the  net  effect 
represented  by  (4.27)  is  of  interest. 

5.  BIAS  ESTIMATION  FOR  NONDIFFERENTIABLE  MODELS 

5. A.  Considerations  for  General  Nondif f erentlable  Models 

The  explicit  formulas  developed  in  the  previous  sec¬ 
tions  to  estimate  systematic  effects  apply  only  to  smooth 
models.  We  will  consider  first  how  to  apply  Box's  bias  for¬ 
mula  to  a  segmented  model.  Similar  reasoning  will  then 
extend  the  aggregation  effect  formula  (4.10)  to  the  energy 
model  which  motivated  its  development. 

The  bias  given  by  (3.3)  is  based  on  a  quadratic  approx¬ 
imation  fl  to  the  model  function  r^.  In  fact,  (3.3)  may 
be  Interpreted  as  the  bias  in  the  least  squares  estimate  0 
for  the  model  function  t}.  This  bias  estimate  is  also  valid 
for  the  estimate  ©  for  the  original  function  n  to  the 
extent  that  ft  approximates  r|  over  the  region  of 
Interest. 

As  described  above,  the  approximation  is  found  from  a 
Taylor's  series  expansion  of  h*  However,  Box's  derivation 
is  equally  valid  for  any  quadratic  approximation  TJ  such 
that 
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(5.1) 


n(e>  =  rue)  ♦  ^ 

and  wherever  r)  is  defined 

ri<e>  »  i\  +  r2 

where  the  remainders  R,  and  R„  are  of  small  order.  For 

1  2 

an  arbitrary  fl,  it  may  be  difficult  to  prove  that  R^ 
and  R^  are  small  enough  to  be  neglected  in  the  derivation 
described  in  the  previous  section.  But  if  the  approximate 

model  f)(6)  and  derivative  tj  are  close  to  the  original 

*  A/ 

tj(6)  and  ti<3)  over  a  wide  region,  and  the  estimate  0 

A 

for  the  approximate  model  is  close  to  the  estimate  0  for 
the  original,  then  the  bias  estimated  for  0  should  give  a 

A 

good  indication  of  the  bias  in  0. 

Box's  derivation  makes  use  of  the  fact  that  the  least 
squares  estimate  for  a  continuously  differentiable  model  is 
a  solution  to  the  normal  equation  (4.6).  For  the  nondif- 
ferentiable  model,  however,  least  squares  estimates  that 

a 

fall  at  points  of  discontinuity  of  ~n  are  in  general  not 
solutions  to  the  normal  equation.  Nevertheless,  the  argu¬ 
ments  just  given  apply  equally  well  to  a  nondiff erentiable 
model.  That  is,  with  the  interpretation  of  (3.3)  as  the  bias 
in  S  for  the  approximate  model.  Box's  formula  requires 
only  that  §  be  a  solution  to  the  normal  equation  for  the 
approximate  model. 

To  apply  this  formula  to  a  particular  nondiff erentiable 
model  of  interest,  then,  we  must  produce  a  quadratic  approx- 
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imation  which  Is  close  to  the  original  model  r)  over 
any  region  of  interest,  and  for  which  the  least  squares 

#<  A 

estimate  0  is  close  to  the  original  estimate  0.  The  prob¬ 
lem  of  constructing  such  an  approximation  is  next  addressed 
for  the  energy  model  (1.1). 

APP.Us.fttl.9n3  to  &£  Energy  Model 

A  quadratic  approximation  to  the  energy  model 
al  +  0X(t),  which  is  nondlfferentlable  with  respect  to  one 
parameter  t  only,  was  found  simply  by  fitting  a  quadratic 
to  X(x),  coinciding  with  X(t)  at  t  ■  ts 

Z  <T.)  *  X  (t)  +  b  (T  -t>  +C  (T  -t)2.  (5.2) 

m  1  m  mi  mi 

The  approximation  X  then  replaced  the  actual  degree-day 
variable  X  in  (1.1). 

For  each  month  m,  the  coefficients  b_  and  c_  were 

3u  31 

found  by  the  method  of  least  squares  for  a  set  of  evenly 
spaced  reference  temperatures  t^.  Details  of  he  fitting 
procedure  are  described  in  previous  work  by  the  author 
(Goldberg,  1982  and  1983).  In  all  cases,  the  smooth  X(t) 

and  derivative  X(*r)  were  quite  close  to  the  original  X(t) 
and  X(t)  over  a  range  of  values  of  t  corresponding  to 
several  standard  errors  of  the  estimate  *t. 

Applying  the  bias  and  aggregation  effect  formulae  (3.3) 
and  (4.10)  to  the  energy  model  (1.1)  for  regional  aggregates 
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requires  a  modification  even  before  the  smoothing  just  dis¬ 
cussed.  The  reason  Is  that  meters  are  read  on  different 
days  throughout  the  month.  Hence,  when  (1.1)  Is  averaged 
over  a  utility  region  or  state,  the  degree-days  X^t1*)  for 
different  households  u  correspond  to  different  one-month 
periods,  as  well  as  to  different  reference  temperatures  tu. 

To  account  for  this  billing  lag,  the  degree-day  variable  X_ 

m 

for  utility  or  state  aggregates  in  month  m  is  defined  as 


.(t) 


(5.3) 


m-1 


N 


Z  j  (t  -  T  ,).  +  Z  <N  -  j)  (x  -  T  . ) 

na _ » _ ill 

Vi  "» 

Z  j  +  I  (N  -  j) 
j-1  j-1  ® 

Total  monthly  sales  per  household  Ym  is  then  fit  to  (1.1) 
with  Xjg  defined  by  (5.3)  rather  than  (1.2). 


Equation  (1.1)  was  fit  by  this  method  for  a  total  of  75 
different  aggregate  data  sets,  corresponding  to  different 
combinations  of  utility  regions  and  time  periods.  Each 
aggregate  was  for  residential  gas  heating  customers  only. 
Details  are  given  in  Goldberg  (1982  and  1983). 


Bias  Estimates  LSI.  £b£  3fflgg£hSd  SlgrgZ  Model 

The  effect  of  using  the  quadratic  approximation  X  in 
place  of  X  in  (1.1)  is  summarized  in  Table  1  for  the  75 
aggregate  data  sets.  The  table  shows  the  median  bias  b 
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Table  1.  MEDIAN  NONLINEARITY  BIAS  b  FOR  75  AGGREGATE  DATA 


SETS  FIT  TO  THE  ENERGY  MODEL 


Bias 

Smoothing  Error 

Standard  Error 

Parameter  0^ 

b  =  £(0^  -  9  ^  ) 

A 

9-0 
j  j 

A 

s.e .  (0_j ) 

BASELOAD  a 
( Th/cu-d ) 

-0.003 

-0.002 

0. 108 

HEAT  RATE  6 
(Th/cu-«Fd) 

0.0003 

-0.0001 

0.0089 

REFERENCE  TEMPERATURE  T 
(»F) 

0.01 

0.02 

1.27 

NORMALIZED  ANNUAL 
CONSUMPTION  T 
(Th/cu-yr ) 

0 

30 

20 

The  bias  b  was  computed  for  the  smoothed  energy  model  using  (3.3).  Shown 
for  each  quantity  is  the  median  value  found  in  75  fits  of  aggregate  data  sets 
to  (1.1).  Abbreviations  are  Th  for  Therms ,  cu  for  customer ,  d  for  day , 
and  yr  for  year. 
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mi  mi 

for  the  parameter  estimate  0  using  X.  Also  shown  <  re 
the  median  soothing  error  (the  difference  between  0  and 
the  estimate  0  for  the  original  model)  and  the  median 
standard  error  for  %  based  on  a  linearization  of  the  model 
(1.1). 

mi 

The  bias  formula  for  normalized  annual  consumption  r 
was  obtained  by  re-expressing  the  model  r)(a,0,T)  as 

ce  (r,3,x)i  *  cr  +  ptx  (t)  -  x  (t)3, 

la  mo 

then  applying  (3.3).  To  calculate  normalized  annual  con¬ 
sumption  f  for  the  approximate  model,  the  degree-day  norm 

X  was  also  smoothed  around  t  via  (5.2). 
o 

For  all  four  parameters  a,  0,  t,  and  T,  Table  1 

shows  that  the  difference  between  the  estimate  #  for  the 

smoothed  model  and  0  for  the  original  is  of  the  same  order 

of  magnitude  or  greater  than  the  nonlinearity  bias  estimate 
*/ 

for  0.  On  the  other  hand,  both  the  error  due  to  using  the 
approximation  and  the  bias  for  the  approximate  model  are 
much  smaller  than  the  standard  error  in  all  cases.  Thus, 

w  A 

even  talcing  the  absolute  difference  between  0  and  0  as 

A 

an  upper  bound  on  the  magnitude  of  the  bias  in  0,  this 
bias  will  be  inconsequential  for  most  purposes.  In  particu¬ 
lar,  neither  comparisons  between  engineering  calculations 
and  parameters  found  empirically  by  fitting  (1.1)  nor  com¬ 
parisons  among  empirical  measurements  will  be  distorted  by 
the  least  squares  estimation  procedure. 
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It  has  generally  teen  known,  and  was  demonstrated  by 


Box  in  producing  (3.3),  that  nonlinearity  biases  are  a 
second  order  effect  compared  with  the  standard  error  of  the 
estimate  in  a  least  squares  regression.  However,  both  the 
conventional  wisdom  and  Box's  result  were  based  on  smooth 
models.  Here,  we  have  shown  that  they  are  essentially  valid 
for  the  nondifferentlable  model  of  the  present  study.  The 
methodology  used  Is  generallzable  to  other  nondifferentlable 
models. 


5-1.  Aggregation  Effects  for  the  Regional  Aggregate 

To  apply  the  aggregation  formula  (4.10),  we  need  not 
only  a  smooth  expectation  function  nr  But  also  an  estimate 
of  the  population  dispersion  matrix  D.  For  the  parameters 
of  the  energy  model  (1.1),  an  estimate  of  D  was  based  on 

the  estimated  parameters  0  from  a  sample  of  71  Individual 
gas-heated  houses  In  New  Jersey.  As  described  by  Dutt  et  al 
(1982),  these  houses  were  selected  to  represent  different 
housing  types  In  the  state.  The  aggregation  formula  as 
applied  to  the  energy  model  and  the  estimation  of  the 
dispersion  matrix  O  for  this  case  are  discussed  more  fully 
in  Goldberg  (1982). 

Table  2  summarizes  the  results  of  applying  the  aggrega¬ 
tion  formula  (4.10  )  to  the  energy  model  (1.1).  The  table 
shows  the  median  estimated  difference  S  between  each 


Table  2.  MEDIAN  AGGREGATION  EFFECT  6  FOR  75  DATA  SETS  FIT 
TO  THE  ENERGY  MODEL 


Parameter  8 . 

3 

Aggregation 

Effect 

*i  ‘  *i  -  e5 

JuS 

s.e. (0^ ) 

Least  Squares 
Estimate 

•j 

BASELOAD  a 
(Th/cu-d) 

-0.037 

-0.30 

1.35 

HEAT  RATE  8 
(Th/cu-#Fd) 

0.0033 

0.44 

0.224 

REFERENCE  TEMPERATURE  T 

CF) 

-0.28 

-0.25 

64. 1 

NORMALIZED  ANNUAL 
CONSUMPTION  r 
(Th/cu-yr ) 

0 

-0.01 

1543 

The  aggregation  effect  6  was  computed  for  the  smoothed  energy  model  using 


(4.10).  See  caption  to  Table  1. 


population  mean  oc,  0,  t,  and  r  and  the  corresponding 
aggregate  parameter  ,  for  the  same  group  of  aggregate  data 
sets  summarized  In  Table  1.  As  a  point  of  reference,  the 
median  value  found  for  each  coefficient  Is  also  Indicated. 
The  table  shows  all  the  aggregation  effects  to  be  quite 
small  compared  with  the  typical  parameter  values. 

The  second  column  of  the  table  shows  the  median  ratio 
of  the  aggregation  effect  to  the  standard  error  of  the 
aggregate  estimate.  From  this  comparison.  It  Is  seen  that 
the  estimated  difference  between  the  aggregate  0A  and  the 
population  mean  0  Is  generally  small  compared  with  the 

*A 

uncertainty  In  the  aggregate  estimate  0  ,  though  not  as 
small  as  the  biases  summarized  in  Table  1. 

a 

As  noted  in  the  previous  section,  the  aggregate  0 
may  be  a  more  meaningful  summary  statistic  for  the  popula¬ 
tion  than  is  the  mean  0.  It  is  extremely  important,  how¬ 
ever,  that  the  results  of  scorekeeping  analyses  be  easily 
understood.  The  close  correspondence  between  the  coefficient 
0*  estimated  by  the  aggregate  analysis  and  the  population 
mean  0  Is  therefore  an  Important  feature  of  the  energy 
analysis  methodology. 


Comparison  of  Empirical  and  Theoretical  Aggregation 

Effects- 
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We  have  just  seen  that  the  aggregation  formula  (4.10) 
allows  us  to  give  a  simple  interpretation  of  the  aggregate 
parameter  estimates  for  (1.1).  Another  application  of  the 
formula  is  in  justifying  a  shortcut  approach  for  large-scale 
retrofit  programs.  If  essentially  equivalent  results  might 
he  obtained  by  analyzing  aggregate  data  only,  rather  than 
analyzing  each  house  individually,  measurement  costs  might 
be  cut  significantly.  It  should  be  noted  that  the  primary 
cost  savings  would  come  from  the  reduction  in  data  collec¬ 
tion  and  handling  required,  rather  than  in  computation. 

Above,  we  found  for  the  state  aggregate  that  the 
difference  between  the  theoretical  aggregate  parameters  0A 
and  the  population  means  6  was  slight.  For  normalized 
annual  consumption  T,  the  parameter  of  major  interest  In 
assessing  a  retrofit  program,  the  aggregation  effect  was 
typically  less  than  one  tenth  as  large  as  the  standard 
error,  and  always  less  that  one  half  as  large. 

To  test  the  aggregation  formula  empirically,  and  to 
determine  whether  similar  results  would  be  found  for  a  small 
group  of  houses,  (4.10)  was  evaluated  for  one  of  the  modules 
of  the  Modular  Retrofit  Experiment  described  by  Dutt  et  al 
(1982).  The  results  are  shown  in  Table  3.  Also  shown  in 
this  table  is  the  empirically  determined  aggregation  effect, 

*A 

the  difference  between  the  estimate  0  for  the  aggregate 

and  the  average  d  of  the  estimates  for  the  17  individual 
houses. 


-32- 


Table  3.  COMPARISON  OF  THEORETICAL  AND  EMPIRICAL  AGGREGATION  EFFECTS 


The  theoretical  estimate  of  the  aggregation  effect 
agrees  with  that  found  empirically  only  In  order  of  magni¬ 
tude.  Recall,  though,  that  we  are  comparing  the  theoretical 

estimate  of  the  expected  difference  E(S^-Q)  between  the 
aggregate  and  mean  parameters  for  the  smoothed  model  with 

aA  A 

the  empirical  difference  ©  -6  for  the  original.  In  addi¬ 
tion,  we  have  only  a  rough  estimate  of  the  diperslon  D  of 
actual  parameter  values  8U  for  the  17  houses.  Thus,  the 
basic  agreement  between  the  theoretical  and  empirical  values 
is  quite  reassuring. 

More  importantly  in  terms  of  energy  analysis,  both  the 
theoretical  and  the  empirical  results  indicate  only  slight 
differences  between  the  aggregate  parameter  and  the  popula¬ 
tion  mean.  The  differences  are  on  the  order  of  10%  as  large 
as  the  median  standard  error  of  Individual  estimates.  That 
is,  the  effect  of  aggregating  Is  small  compared  to  the 
uncertainty  In  Individual  estimates.  For  the  index  T,  the 
difference  between  the  mean  and  aggregate  values  is  less 
than  4%  of  the  aggregate  standard  error  and  less  than  0.2% 
of  the  aggregate  estimate. 

The  standard  deviation  of  estimates  T  across  the 
population  Is  much  larger  than  the  standard  error  of  the 

aggregate  estimate  r*.  That  is,  while  the  aggregate  parame¬ 
ter  is  very  well-determined,  the  aggregate  analysis  alone 
does  not  indicate  the  variability  of  r  within  the  popula- 
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tlon.  Relying  solely  on  aggregate  data  would  preclude 
obtaining  such  information.  However,  if  the  mean  value  r 
is  the  only  parameter  of  Interest  the  aggregate  analysis  is 
entirely  sufficient. 


&.  CONCLUSIONS 

We  have  seen  that  the  bias  b  for  the  single-response 
case  and  the  asymptotic  aggregation  effect  S  are  very 
similar,  with  the  estimation  variance  and  population  vari¬ 
ance  of  the  true  values  playing  parallel  roles.  Both  the 
aggregation  effect  and  Box's  bias  are  induced  by  the 
parameter-effects  nonlinearity;  in  the  case  of  bias,  a  sym- 

A 

metric  distribution  of  fits  T)(0)  translates  into  an  asym- 

A 

metric  distribution  of  estimates  0,  while  in  the  aggrega¬ 
tion  case,  a  symmetric  distribution  of  true  parameter  values 
translates  into  an  asymmetric  distribution  of  points  n(9U>» 
When  the  bias  and  aggregation  effect  are  of  comparable  mag¬ 
nitude,  the  two  can  be  treated  as  additive. 

An  Important  difference  between  the  bias  and  the  aggre¬ 
gation  effect  is  their  behavior  as  the  number  of  observa¬ 
tions  Increases.  For  the  bias,  the  estimation  variance 

2  *  *  -i 

o  (n'h>  decreases  with  increasing  sample  size.  As  a 


result,  the  magnitude  of  the  bias  decreases.  Furthermore, 
the  accuracy  of  Box's  approximation  (3.3)  based  on  the 
assumption  of  small  e.  Improves. 


For  the  aggregation  effect,  on  the  other  hand,  the 

dispersion  D  of  true  values  0U  is  fixed.  Therefore, 
Increasing  the  sample  size  will  neither  improve  the  accuracy 
of  the  approximation  (4.10)  nor  reduce  the  magnitude  of  the 
aggregation  effect.  An  important  consequence  is  that  in  a 
sufficiently  large  sample  the  aggregation  effect  may  be  com¬ 
parable  in  magnitude  to  the  standard  error  of  the  estimate. 
By  contrast,  to  the  accuracy  of  the  second  order  approxima¬ 
tion,  the  bias  is  always  of  smaller  order  than  the  standard 
error.  Thus,  while  the  bias  caused  by  nonlinearity  can  rea¬ 
sonably  be  Ignored  in  most  applications,  the  effect  of 
aggregation  may  be  quite  Important. 
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